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ABSTRACT 

The 21cm forest - HI absorption features in the spectra of high-redshift radio sources 
- can potentially provide a unique probe of the largely neutral intergalactic medium 
(IGM) during the epoch of reionization. We present simulations of the 21cm forest due 
to the large scale structure of the reionization-era IGM, including a prescription for x- 
ray heating and the percolation of photoionization bubbles. We show that, if detected 
with future instruments such as the Square Kilometer Array (SKA), the 21cm forest can 
provide a significant constraint on the thermal history of the IGM. Detection will be 
aided by consideration of the sudden increase in signal variance at the onset of 21cm 
absorption. If radio foregrounds and the intrinsic source spectra are well understood, 
the flux decrement over wide bandwidths can also improve detection prospects. Our 
analysis accounts for the possibility of narrow absorption lines from intervening dense 
regions, but, unlike previous studies, our results do not depend on their properties. As- 
suming x-ray heating corresponding to a local stellar population, we estimate that a 
statistically significant detection of 21cm absorption could be made by SKA in less 
than a year of observing against a Cygnus A-type source at z <~ 9, as opposed to nearly 
a decade for a significant detection of the detailed forest features. We discuss observa- 
tional challenges due to uncertainties regarding the abundance of background sources 
and the strength of the 21cm absorption signal. 

Key words: large-scale structure of Universe - cosmology: theory - quasars: absorp- 
tion lines - galaxies: high-redshift - intergalactic medium. 



1 INTRODUCTION 

In both theoretical and observational astronomy today, a great 
deal of attention is being focused on improving our understand- 
ing of the epoch of reionization, when luminous sources be- 
gan to ionize the intergalactic medium (IGM) for the first time 
after recombination. Understanding the process of reionization 
would lend insight into not only the nature of the first sources 
themselves, but also the evolution of the IGM, the build-up of 
large-scale structures, and the radiative transfer processes oc- 
curring at high redshifts. Currently, the most powerful data sets 
for constraining the ionization history of the IGM are the Gunn- 
Peter son absorption in the Lyman-a forest dBolton & Haehneltl 
l2007h and the electron-scattering optical d epth as observed i n 
the cosmic microwave background (CMB) dLarson et al.l201oT) . 
The former places a lower limit on the fraction of hydrogen 
which is neutral (limited by the fact that the Gunn-Peterson 
trough saturates at a neutral fraction of ~ 10 ) and the lat- 
ter measures the integrated level of ionization between the ob- 
server and the CMB. Thus we find a range of redshifts during 
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which the universe went from substantially neutral to substan- 
tially ionized, but the detailed history of the reionization process 
is difficult to examine with these methods. 

The hyperfine transition of ground-state neutral hydrogen 
(with a rest-frame wavelength of 2 1cm) has the potential to offer 
much more detailed information about the epoch of reionization 
than either the Lyman-« forest or the CMB optical depth mea- 
surement. Since 21cm radiation is the result of a low-energy 
transition at radio wavelengths, it can free-stream to the ob- 
server without being substantially absorbed in the intervening 
IGM. Also, most studies of the 21cm transition at high redshift 
focus on the signal in absorption or emission against the CMB, 
so the mean IGM itself rather than rare overdensities can be 
probed. 

There has been a great deal of recent progress in observa- 
tion and instrumentation aimed at the search for the redshifted 
21cm signal from the epoch of reionization. Several large-scale 
radio interferometers are currently being assembled to constrain 
the redshift of reionization as well as to produce all-sky bright- 
ness temperature measurements and a 21cm fluctuation power 
spectrum. These instruments include the Low Frequency Array 
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(LOFArQ in the Netherlands, the Giant Metrewave Radio Tele- 
scope (GMRTf| in India, the Experiment to Detect the Global 
EoR Signal (EDGEsfl the Precision Array to Probe the Epoch 
of Reionization (PAPERfl and the Murchison Widefield Array 
(MWA0 in Western Australia. Planning is also underway for 
the Square Kilometre Array (skaJI 

In this work, we discuss how studies of 21cm absorption 
against high-redshift radio sources can be used to study the evo- 
lution of the IGM around the epoch of reionization in what 
is known as the "21cm f orest" dCarilli, Gnedin & Owenll2002l : 
iFurlanetto & Loebll2002h . In Section [2] we discuss how obser- 
vations of the 21cm forest have potential advantages over other 
probes. We describe in Section[3]our simulation of the expected 
signal using an IGM evolution code. In Section [4] we discuss 
our parameterization of the x-ray heating of the IGM, and in 
Section[5] we discuss uncertainties regarding the population of 
high-redshift sources that could potentially be used as back- 
ground sources for 21cm forest observations. Using a hypothet- 
ical radio-loud high-redshift source as an example, we present 
sample 21m forest spectra in Section|6] In Section]?] we show 
how statistical detection methods can be used to isolate a faint 
signal in the case that radio-loud sources are rare at high red- 
shift. We present our conclusions and outlook for future work 
in Section[8] 



2 21CM FOREST 

The optical depth to 21cm absorption of neutral hydrogen along 
the line of sight to a distant radio source depends primarily upon 
the density, temperature and ionization state of the gas. For a 
cloud of gas at redshift z, with overdensity 5 = (p/po) — 1 
(where p is the density and po the mean density), neutral frac- 
tion xhi, an d spin temperature Ts, the optical depth to 21 cm 
absorption is l lFurlanetta Oh & Briggj|2006l : IXu et all 20091) : 
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Here, vq is the observing frequency, h p is Planck's constant, c 
is the speed of light, Aio is the Einstein A coefficient, fcs is 
the Boltzmann constant, nu is the number density of hydrogen 
atoms, v\\ is the radial velocity of the gas, r« is the distance to 
the gas and H(z) is the Hubble parameter. The factor in square 
brackets accounts for the peculiar velocity of the gas; for the 
purpose of this calculation, we will assume the gas is in the 
Hubble flow. The spin temperature of the gas is defined by 
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where T 7 is the temperature of the CMB, T c is the color tem- 
perature of the Lyman-a background, Tk is the gas kinetic tem- 
perature, and x c and x a are coupling coefficients for collisions 
and the Wouthuysen-Field effect, respectively. 

If a radio source is observed to exist in or before the epoch 
of reionization, the structure of the neutral hydrogen gas along 
the line of sight to the source should be apparent in 21cm ab- 
sorption in the radio spectrum. Several distinct features may 
be seen: a flux decrement blueward of the redshifted 21cm ab- 
sorption frequency, whose depth depends upon the mean opti- 
cal depth of the IGM at that redshift; small-scale variations in 
flux due to fluctuations in the density and ionization state of the 
IGM along the line of sight; transmission windows due to pho- 
toionized bubbles along the line of sight; and deep absorption 
features due to dense neutral hydrogen clouds in dwarf galax- 
ies and minihalos. The spectral features due to 21cm absorption 
are known, collectively, as the 21cm forest, in analogy to the 
Lyman-a forest. In contrast to the Lyman-a forest, however, 
the 21cm forest occurs at significantly higher redshifts (z > 7) 
and contains detailed information about the IGM even when 
the IGM is largely neutral. In the Lyman-a forest, the Gunn- 
Peterson trough reaches zero flux when the ionized fraction of 
the intervening gas reaches approximately 1 part in 10 5 . Also, 
the 21cm forest has the potential to probe very small structures 
(limited primarily by the frequency resolution of the instrument) 
at lower overdensities than the Lyman-a forest. Much of the in- 
formation in the 21cm forest will come from structures that are 
still in the linear regime, potentially making it possible to probe 
the matter power spectrum more directly than would be possible 
via the Lyman-a forest. 

As we discuss below, the utility of the 21cm forest as a 
diagnostic of the IGM depends upon both the evolutionary his- 
tory of the IGM in heating and ionization and the abundance of 
appropriate background sources at high redshift. We show that, 
should sufficiently bright background sources be found, obser- 
vations of the 21cm forest with instruments such as the SKA 
can be useful in constraining the thermal history of the IGM as 
well as probing the growth of structure at high redshift. 



3 NUMERICAL SIMULATIONS 

To simulate the 21cm forest spectrum, we combine the results 
of two numerical simulations. The first is a time-evolution code 
tracking the mean properties of a uniform IGM over cosmic 
time, and the second a non-uniform two-phase IGM simulation 
describing small-scale spatial inhomogeneities . The mean IGM 
code i s an adaptation of a code developed by iMack & Weslevl 
J2008I) . It includes cosmological expansion, adiabatic cooling, 
spin temperature coupling with the CMB at high redshift, and 
x-ray heating of the gas via a star fo rmation rate derived from 
calculations in lGeil& Wvithel < l2008l) . The relationship between 
th e star-formation rate (SFR) and the x-ray luminosity is defined 
bv lFurlanettol ( l2006bl) : 



L x = 3.4 x 10 40 /js 
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1 M 



yr - 
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where fx is an x-ray efficiency factor. We assume a spectral in- 
dex of a — 1.5 in the frequency range between 0.2 keV and 10 
keV for x-ray sources. The factor fx is essentially a free param- 
eter. In this section, we assume fx = 1, but in subsequent sec- 
tions we vary this quantity. (See Section|4]for more details.) The 



© 2011 RAS, MNRAS 000,[T]{9] 



Detecting the 21cm forest 3 



two-phase IGM code was developed bv lGeil&WvitheU2008h 
and includes a semi-analytic prescription for star formation and 
subsequent photoionization. The size of the simulation box is 
70 cMpc cubed, with 360 pixels per sidelength. We piece to- 
gether four separate (arbitrarily chosen) ID slices through the 
simulation box in order to include a range of local environments 
through which the line of sight may pass, giving a total of 280 
cMpc. Due to the nature of this study as a discussion of the fea- 
sibility of future measurements rather than a detailed quantita- 
tive analysis of simulation results, a dedicated, high-resolution 
simulation of the IGM was not necessary. 

For the purpose of illustrating the basic properties of the 
21cm forest for a source at high redshift, we first evolve the uni- 
form IGM from high redshift to z = 7 and calculate the optical 
depth to 21cm absorption for the range 7 < z < 25. Then, 
using the mean gas temperature and ionization fraction of the 
uniform IGM, combined with the density fluctuations and HII 
regions from slices through the non-uniform two-phase simu- 
lation, we produce a simulation of the small-scale variations in 
21cm absorption in a restricted range from z = 8 to z = 9. 
This allows us to produce 21cm forest spectra against a hypo- 
thetical z — 9 radio source over a wide range of frequencies, 
with x-ray heating of the gas taken into account. Large spatial 
inhomogeneities in the gas temperature are unlikely to signifi- 
cantly alter our results, as the the mean free path of the x-rays 
responsible for the heating is generally comparable to or greater 
than the size of the non-uniform simulation box. 

Note that our simulation includes large scale features of the 
IGM such as photoionized bubbles, but does not include col- 
lapsed structures such as galaxies and minihalos, which could 
be expected t o produce deep absorption spikes in the 21cm for- 
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some estimates as to the number and size of abs orbers that 
should be found in the IGM at these redshifts (e.g., IFurlanettol 
l2006al) , but these depend strongly on modelling not only small- 
scale overdensities and their gas properties, but also their heat- 
ing and photoevaporation. In this study, since our main interest 
is in the mean properties of the IGM, the presence of small- 
scale absorption features would complicate the mean-IGM sig- 
nal, rather than being of interest as features themselves. To ac- 
count for the effect absorption lines would have in observations 
of a distant source spectrum, we have developed a procedure for 
stochastically inserting absorption lines into the spectra of our 
hypothetical sources. This method is discussed in more detail in 
Section[6] 

In Figures [JJ and [2] we plot selected quantities output by 
the IGM simulations. Figure [Tal shows the mean ionized frac- 
tion in the uniform-IGM simulation. The simulation has been 
calibrated for the overlap phase to occur at redshift z — 6. 
Reionization in this calculation is achieved primarily through 
the growth of photoionized regions due to star formation, al- 
though the small contribution from x-ray heating is also taken 
into account. For the purpose of this study, we do not present 
alternative reionization histories or morphologies, but rather 
consider an example in which our calculations can be used to 
test the feasibility of future measurements. It has been shown 
dMcOuinn et al.ll2007l) that the neutral hydrogen fraction is the 
most important determinant of the structure of ionized regions 
during reionization; therefore, it should be straightforward to 
generalize our basic results by referring to the IGM ionization 
evolution we present in Figure [Tal 



In Figure Qb] we plot the evolution of the kinetic tempera- 
ture of the gas (Tk , solid line) along with the CMB temperature 
(T 7 , dotted line) in the uniform-IGM simulation. The spin tem- 
perature of the gas (Ts) is coupled to the CMB at high redshift 
and then to the gas temperature by z < 17. In our calculations 
of the optical depth, we assume Ts — Tk, which is a good ap- 
proximatio n at the redshifts wh ere a detection of the 21cm forest 
is feasible iSantos et al.ll2008l) . In Figure [Tc] we plot the mean 
21cm optical depth as a function of redshift from the uniform- 
IGM simulation. In this plot, the optical depth r is somewhat 
overestimated for z > 17 due to the Ts = Tk approximation 
l lSantos et alj2008h . 

In Figures[2a]and[2b] we plot the overdensity 5 and the neu- 
tral fraction xhi, respectively, in the non-uniform simulation 
region as a function of redshift. The algorithm used to calculate 
overdensity allows for values 8 < —1; we have removed these 
unphysical overdensities by placing a cutoff at S — — 1, The 
plot of xhi is on a linear scale to emphasize an important fea- 
ture of the non-uniform simulation: the presence of photoion- 
ized bubbles. We have filled the region below the curve to show 
the contrast between regions of low ionization (which appear 
solid) and fully-ionized bubbles (which show up as white gaps). 
These bubbles have a distinct signature in the 21cm absorption 
spectrum against a background source, which we will discuss in 
more detail in Section [7Tl 



4 X-RAY HEATING EFFICIENCY 

The details of the heating of the IGM over cosmic history are 
still fairly unconstraine d |Mitra i Chgufliur y & Ferrari l20ld : 
IPritchard & Loebl 120081 : IFurlanettol l2006bl) . To account for 
possible differences in the thermal history of the IGM, we 
parametrize the heating by the x-ray efficiency factor fx, de- 
fined in equation (0- This is a free parameter, relating to the effi- 
ciency with which the x-rays produced by star formation couple 
to the IGM. We take fx to be anywhere from 0.01 to 100, which 
is con sistent with other recent works, e.g., IPritchard & Loebl 
l20T(il) . In the local universe, fx ~ 1- Although a direct link 
between fx and a specific physical model of star formation 
and radiative transfer is not straightforward, constraining fx (to 
within an order of magnitude, for instance) would be a helpful 
guide to future model building. 

In our simulation, the main effect of the variation of fx is 
a change in the gas temperature, and, thus, the mean 21cm op- 
tical depth. In Figure [3al we plot the mean 21cm optical depth 
as a function of redshift for fx values between 0.01 and 100. 
The thick black line is for fx — 1, as in Figure [Tel The lines 
with higher (lower) optical depth have lower (higher) fx val- 
ues, representing less (more) effective heating. We also plot, in 
Figure[3b] the optical depth in the "forest region" where the non- 
uniform simulation determines the overdensity and ionized frac- 
tion. In this figure, we assume a background source at z = 9. 
The low-frequency cut-off in the forest is due to the frequency 
of onset of 21cm absorption, and the high-frequency cut-off is 
due to the finite extent of the non-uniform simulation region. 
From a close inspection of Figures [2] and [3b] it can be seen that 
the variation in neutral fraction - specifically the presence of 
photoionized bubbles - dominates the spatial variation in the 
21cm optical depth, whereas the density fluctuations are sub- 
dominant. We also see from Figure [3] that varying fx by an 
order of magnitude can have a roughly order-of-magnitude ef- 
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feet on the optical depth. This is due to the fact that the overall 
optical depth is strongly dependent on the spin temperature Ts, 
which varies by roughly an order of magnitude with each factor 
of 10 in f x . 

From Figures [3al and [3bl we see that if the 21cm forest 
is detected in the spectra of high-redshift radio sources, mea- 
surements of the optical depth can constrain the parameter fx 
and thus give us insight into the thermal history of the IGM. 
Against a sufficiently bright source, the 21cm forest spectrum 
can reveal the layout of photoionized bubbles along the line of 
sight (which would appear as transmission windows between re- 
gions of absorption), and, potentially, variations in density and 
ionized fraction. Narrow absorption lines from dense neutral re- 
gions would also be detectable in such observations. Absorp- 
tion lines can have optical depths at levels as high as a percent 
l lFurlanettoir2006al) . but because of their narrowness (typically 
2-3 kHz) their detectability in low mean optical depth regions 
would be highly dependent on their number and density. In prin- 
ciple, fx can also affect the number and depth of absorption 
lines, by altering the background radiation field in which small 
halos or protogalaxies reside. We discuss the implications of ab- 
sorption lines in Section 6. 

For some redshifts and fx values, the mean optical depth 
is so low as to make detection of 21cm forest features difficult 
or impossible, even with future instruments such as SKA. For a 
very radio-loud source at high redshift, a wide range of thermal 
histories will produce a detectable forest signal, and the optical 
depth can be used to constrain fx, but sources with low flux 
density in the frequencies of interest may result in a detection 
only for low values of fx- In this case, a non-detection of the 
21cm forest may provide a constraint on the thermal history. 

In Section|5]and[6]below, we discuss how prospects for the 
detection of the 21cm forest depend upon the source population 
and the thermal history of the IGM, and we consider how the 
presence of narrow absorption lines can complicate the extrac- 
tion of information about the thermal history. Then, in Section 
7.11 we discuss methods for statistical detection in cases where 
the absorption spectrum has low signal-to-noise. 



a cloud of gas with optical depth r: 



5 SOURCE POPULATION AND DETECTABILITY 

The biggest challenge for doing observations of the 21cm forest 
will be finding sufficiently distant, radio-loud sources. The op- 
tical depth to 21cm absorption increases with the redshift, den- 
sity and neutral fraction of the intervening gas, but decreases 
with the gas's spin temperature. Therefore the strongest signals 
will be found in the spectra of very high redshift sources - those 
occurring prior to the epoch of reionization. However, the detec- 
tion of 21cm absorption also becomes more difficult as the radio 
flux density of the source decreases: dimmer sources offer fewer 
photons and the difference between regions of the spectrum that 
are partially absorbed and those that are fully unabsorbed be- 
comes less apparent. 

To quantify this trade-off between the decreasing trend in 
optical depth at lower re dshift and the decreasing source flux 
density at high re dshift, iFurlanetto. Oh & Briggsl d2006h and 
iFurlanettol I l2006ar) have related the minimum flux density Smin 
of a source that could be used to detect with signal to noise S/N 
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The parameters A e // and Av c h are the effective area of the 
telescope array and the channel bandwidth, here taken to be 
those plausible for the SKA, and tint is the integration time. 
The system temperature, T 3ys , is dominated by the Galactic 
synchrotron fore ground at the frequencies of interest. This can 
be approximated lcarillil2006l) as 
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This equation has been used as a proxy for the observability 
of the 21cm forest; in this work, we note that while a source 
above S m in is required for a detailed study of the structure of 
the forest, it does not directly relate to the detectability of the 
presence of absorption or to the mean flux decrement that would 
be present in an IGM with mean optical depth r. We elaborate 
on this point in Sections [6] and[7]below. 

To illustrate the high source fluxes required for detailed 
forest observations to be feasible, we include dashed lines cor- 
responding to Smin values in Figure [3a] which plots the mean 
21cm optical depth as a function of redshift for a range of val- 
ues of fx ■ For each fx curve, if the mean optical depth line lies 
above the S m in line for a given flux density, the signal-to-noise 
for that set of parameters should be at least 5 at that redshift. 
In regions of the plot where the optical depth line lies below 
the S m in line, a detection of the forest at signal-to-noise of 5 is 
not possible with the parameters of the observation that we have 
assumed. 

Equation l[5j illustrates the crucial trade-off in attempts to 
observe the 21cm forest. Successful observations will require 
both a high optical depth to 21cm absorption and a population 
of very radio-loud sources. The former pushes toward high red- 
shift, while the latter is more feasible at low redshift. The key 
issue, then, is whether there exist sufficiently radio-loud sources 
at sufficiently high redshift such that the IGM is still largely 
cold and neutral. If radio-loud sources only occur within or in 
conjunction with massive halos that produce a large amount of 
ionizing radiation, or if early dense neutral regions are not well 
shielded, this trade-off becomes difficult to achieve. 

Estimates of the abundance of radio- loud sources at high 
redshift are uncertain. lHaiman et alj J2004I) present an estimate 
of the number of radio-loud sources per square degree for a 
range of flux densities out to redshift z = 15. According to 
this estimate, in the redshift range 8 < z < 12, there should 
be ~ 2000 sources above ~ 6 mJy in the full sky. However, 
these estimates depend strongly on assumptions about the radio- 
loud fraction and t he spectral steepness of high-redshift sources. 
Ijiang et al.l d2007h have presented evidence that the radio-loud 
fraction may decrease with redshift, which would make discov- 
ery of appropriate 21cm forest background sources less likely. 
In this work, we have shown that objects with 10-100 mJy at 
z > 8 are most favorable as background sources. If they ex- 
ist, they may have al r eady b een discovered in surveys such as 
FIRSlfl. Ilvezic et al.l d2002h have found that ~30% of FIRST 



http://sundog.stsci.edu/ 



© 2011 RAS, MNRAS 000,[T]{9] 



Detecting the 21cm forest 5 



sources at 10-100 mjy have no detected optical counterpart 
within 3" (via SDSS). If optical and infrared observations are 
carried out on these unidentified sources, some may be found to 
lie at high redshift, making them ideal candidates for observa- 
tions of the 21cm forest. We consider the issue of the abundance 
of background sources to be an open question, but we anticipate 
that it may be resolved soon with new observations and contin- 
ued follow-up of known radio-loud sources. 

In order to illustrate the qualitative properties of 21cm for- 
est observations, we will assume a background source at high 
redshift with spectral properties similar to the powerful local ra- 
dio source Cygnus A. The intention in this assumption is not to 
assert that we would aim to carry out these observations against 
sources physically similar to Cygnus A at high redshift, or even 
that such sources necessarily exist, but rather to give a point of 
reference for the brightnesses and qualitative spectral features 
in our example calculations. 

Figures [4a] and [4b] use the spectrum of our hypothetical 
Cygnus A-like source to illustrate in a different way the trade- 
off between optical depth and source brightness. For fx=(0, 
0.01, 0.1, 1), we plot as a function of redshift the minimum flux 
density Smin required for an SKA-like instrument to detect the 
21cm forest at signal-to-noise of 5. We assume frequency res- 
olution Ai/ C h = 1 kHz and integration times tint of one week 
and one year, respectively. On the same axes, we also plot the 
flux density of a Cygnus A-like source as it would be observed 
at that redshift, at an observing frequency corresponding to the 
rest-frame 21cm line. In this plot, if the S m in curve for a par- 
ticular value of fx is below the Cygnus A flux density curve, 
then a source similar to Cygnus A would be sufficiently bright 
to allow the detection. We see that for a week-long integration, 
detection is possible only if fx < 0.01, even for a source exist- 
ing at redshift as high as z = 10. For the same observation with 
a year-long integration, the example radio source would be suf- 
ficiently bright to allow detection for fx = 0.1, but the f x = l 
scenario still does not provide sufficiently high optical depth. A 
similar exercise for Av c h — 100 kHz (which reduces S m in by 
an order of magnitude in each case) shows that for fx = 1, a 
week-long integration would still be insufficient, but a year-long 
integration could detect the forest at high significance against a 
Cygnus A-like source at z > 7. 



6 SIMULATED SPECTRA 

In order to simulate a realistic 21cm forest measurement, we 
have produced synthetic spectra for absorption against a hypo- 
thetical Cygnus- A-like source placed at z = 9, with added noise 
appropriate t o an observat i on wit h the SKA, based on noise es- 
timates from ICarilli et alJ d2004h . For a spectrum with 1 kHz 
frequency resolution and a week-long integration, we assume 
Gaussian noise at a level of a « 41/xJy. In Figures [5all5bl and 
l5c] we plot the 21cm absorption spectrum with noise included 
for x-ray heating with fx = 1, 0.1, and 0.01, respectively. 

Although the absorption due to the 21cm forest is marked 
in the fx = 0.01 scenario, it is difficult to see by eye in the 
fx =0.1 spectrum and is visually indistinguishable from noise 
for fx = 1. In this simulation, we have included only mild 
overdensities with S ~ 1, and the prescription used to simulate 
the ionization state of the gas results in an ionization fraction 
correlated with density. Therefore there are no highly overdense 
neutral regions included here. 



In works by other authors [e.g., ICarilli et al.l d2004l) ; 
IXu et al.1 ( 12009k dense neutral regions produce sharp absorp- 
tion spikes in the 21cm forest spectrum. Our simulations do 
not include these regions (protogalaxies or minihalos - the pro- 
genitors of damped Lyman-a systems). As discussed in Sec- 
tion [3] above, the estimates of the number and character of 
such absorbers are highly uncertain, and their dependence on 
the state of the mean IGM (which is our main interest here) 
is not straightforward. However, we also recognize that the in- 
clusion of such systems might be important in the analysis of 
the spectra. Therefore, we have developed a stochastic method 
to produce absorption features "by hand" which we then in- 
sert into the spectra to simulate the effect of these narrow 
absorption lines. We produce N spectral lines [where N — 
(5, 10, 50, 100, 200, 300, 400, 500)] by drawing fro m a distri- 
bution of optical depths based on the calculation in iFurlanettcl 
J2006ah and a linewidth of 2-3 kHz. After including instrumen- 
tal noise, we randomly place these lines in the spectra in the 
non-uniform simulation region between 142 and 148 MHz (cor- 
responding to z = 8 — 9). In order to take into account the 
fact that small halos would likely be photoevaporated in highly 
ionized bubbles, we throw out any lines that are assigned to 
high-ionization regions. The resulting spectra are not meant to 
be quantitative predictions of future observational results, but 
rather to provide a plausible basis for the determination of the 
study's feasibility and specifically to show how the presence of 
absorption lines would affect estimates of fx based on the study 
of the spectra. 

In Figure [6] below, we show simulated spectra for fx = 
0.1 with 100 and 500 absorption lines added, in[6a]and[6b] re- 
spectively. Note that since lines randomly assigned to ionized 
regions are not included, the number of lines appearing in the 
spectrum is somewhat smaller than the number of added lines 
(around 3 A of the lines remained in these cases). Comparing Fig- 
ure [6] with Figure l5bl we see that when absorption lines are in- 
cluded, while the flux decrement and bubble structure are still 
present, the most prominent feature of the spectrum becomes 
the forest of absorption lines. 



7 STATISTICAL DETECTION METHODS 

Here we present two methods to derive an estimate of fx from 
the 21cm forest spectrum. In the first method, we show that the 
structure of transmission regions in the spectrum (due to pho- 
toionized regions in the line of sight) leads to an increase in the 
spectrum's variance in the absorbed region. The relative flux be- 
tween the absorption and transmission regions is determined by 
the mean optical depth, which depends on the gas temperature, 
which itself depends on fx. Therefore fx can be determined 
by the degree to which the spectrum's variance increases in the 
region blueward of the redshifted 21cm transition. This method 
is most appropriate when there are no strong absorption lines in 
the spectrum. 

The second method can be used even when many strong 
absorption lines are present. In this method, the spectrum is 
"normalized" (source-subtracted) and divided into the absorbed 
and unabsorbed regions (blueward and redward of the 21cm 
transition, respectively). The flux distribution in each region is 
fit to a Gaussian, from whose shape we can derive properties of 
the IGM. The anomalously low fluxes in pixels corresponding 
to narrow absorption lines cause them to be outliers in the flux 
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distribution, which effectively allows us to subtract them from 
the signal of the "typical" IGM that would be seen if no lines 
were present. Thus they do not hinder our ability to determine 
the properties of the IGM, which is our main interest in this 
work. 

In an actual observation, the intrinsic spectrum of the 
source would not be known - it might instead be approxi- 
mated by a powerlaw of unknown slope. Bright radio fore- 
grounds would also complicate the subtraction. Since the fre- 
quency range of the measurement is much smaller than the cen- 
tral frequency (8v/v <C 1), in practice it may be possible to ac- 
curately extrapolate the foreground and continuum component 
from the unabsorbed side to the absorbed side of the spectrum. 
We therefore consider our results utilizing the source-subtracted 
(normalized) spectrum to be a best-case scenario. In particular, a 
flux decrement measurement (see Section lT2t over a wide band 
would be highly dependent on accurate foreground and source 
subtraction. However, estimates of the variance discussed in 
Sections [7,ll and [T2l below are not subject to this continuum un- 
certainty, as the width of the spectrum's flux distribution should 
be robust even if its normalization is not known precisely. 

In Sections [7.1l and lT2l below, we describe these statistical 
detection methods in more detail and show how they can be 
used to study the thermal history of the IGM in cases where the 
features of the 21cm forest cannot be studied at high signal-to- 
noise. 



7.1 Variance increase in absorbed region 

As can be seen in the fx = 0.01 case without absorption lines 
included (Figure[5cl>, the most striking features of the spectrum 
are the step-like transitions between transmission through pho- 
toionized bubbles and absorption in largely neutral regions. The 
detection limits discussed in Section [5] can be stated in terms 
of the minimum flux density Smin of a background source that 
would allow a clear detection of an absorption feature. Individ- 
ual absorption features can be seen clearly in Figure [5c] as can 
the "DC offset" or flux decrement in the absorption region, but 
in Figure [5b] the flux decrement is less apparent, and in Figure 
[5a] the offset cannot be seen by eye. However, one can see in 
Figure [5b] (and to a much greater extent in Figure [5ct that the 
variance (or apparent noisiness) of the spectrum increases at the 
onset of 21cm absorpt ion, around 142 M Hz. This increase in 
variance, discussed bv lCarillietaT1 l l2004h . may be used to de- 
tect the 21cm forest statistically in spectra for which the signal- 
to-noise is low. 

Using our synthetic 21cm forest spectra, we investigate 
the effectiveness of the statistical detection technique. After 
smoothing the spectrum to a given frequency resolution Av c h, 
we add Gaussian noise to each pixel in frequency space and 
then subtract the input source spectrum. The source-subtracted 
spectrum is then broken into regions of width w, and in each 
of these windows, the variance of the spectrum is calculated. 
For a spectrum with no absorption, as is seen in frequencies 
redward of the source redshift, the variance should be equal to 
the variance of the input Gaussian noise, a 2 . Blueward of the 
frequency of the onset of absorption, the variance increases as 
the absorption level varies due to density and ionization-state 
fluctuations in the intervening IGM. The largest variances oc- 
cur between regions of maximal absorption (from substantially 
neutral clumps of gas) and regions of no absorption (photoion- 



ized bubbles). Therefore a very small w (of order the frequency 
resolution Au c h) will pick up little variance above that of the 
Gaussian noise, since the variance within the window comes 
primarily from the signal variance due to noise. Windows of 
order the size of photoionized bubbles in frequency space will 
have the maximum variance, as they will generally cut across re- 
gions of maximal and minimal absorption. As the window size 
increases further, the variance in the forest region will eventu- 
ally level off as a large number of bubbles are averaged over and 
the mean optical depth drops at high frequencies. 

To illustrate this, we calculate the mean variance in win- 
dows of different widths, w, on either side of the onset of 
21cm absorption. For absorption against a hypothetical radio- 
loud source at z = 9, we can split the spectrum roughly into 
the pre-absorption and post-absorption regions by comparing 
the signal variance in the region from 130 to 140 MHz to the 
signal variance in the region from 140 to 150 MHz. The mean 
signal variance (erf) in a window of width w is the mean of 

^ = ( F *r,» - ( F *,i)) 2 (7) 

where uo is the starting frequency in window i, Fn, v indicates 
the source-subtracted flux at frequency v and (Fn,%) is the mean 
source-subtracted flux across window i. The window sizes con- 
sidered range from 50 kHz to 5 MHz, and we look specifically 
at models with fx = 0.1 and fx = 1 in order to show how 
the variance increase can improve the detectability of the 21cm 
forest in cases without a strong DC offset. 

In Figures[7a]and[7b] we plot the mean binned signal vari- 
ance as a function of frequency for windows of different sizes, 
for fx = 0.1 and fx = 1, respectively. In the fx = 1 case, 
the increase in variance at the onset of absorption is not ap- 
parent by eye for any of the window sizes. In the fx =0.1 
case, on the other hand, the effect of the photoionization bub- 
bles is clear in the increase in variance. For small window sizes 
(w = 50 — 100 kHz), the increase in variance at the onset of 
absorption is difficult to discern, as the scales probed are too 
small to pick up typical photoionization bubbles, and the vari- 
ance is similar to that of the Gaussian noise (a 2 « 1.7 mJy 2 ). 
For larger window sizes (w = 0.5 — 5 MHz), however, there is a 
clear increase in the mean variance after the onset of absorption 
at v fts 142 MHz. It is important to note that the curves for the 
larger window sizes are not simply a smoothed version of the 
curves for smaller windows: except in the pre-absorption region 
where there is no 21cm signal, the 5 MHz curve would fail as a 
fit to the 50 or 100 kHz curves. This is because the larger win- 
dows are seeing more of the photoionization bubble structure, 
whereas the smaller windows are largely missing it and picking 
up primarily the Gaussian signal variance on top of the (rela- 
tively weak) optical depth variations due to mild overdensities 
and regions of fractional ionization. 

For the purpose of this calculation, we have defined the un- 
absorbed region of the spectrum to be the frequency range 1 BO- 
MO MHz and the absorbed region 140-150 MHz [though in an 
actual observation, if we know the redshift with high accuracy 
we can obtain greater precision by using 1420 MHz x(l + z) 
as the dividing line]. In each region, we take the mean of the 
variance and calculate the difference between the means in the 
two regions. In Figure [8] we plot this difference as a function 
of integration time for a window of size 1 MHz and fx = 1. 
For comparison, we also include the difference for fx = 0.1 
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at an integration time of 1 week. This plot shows that for small 
integration times, the variance method still will not allow us to 
detect the forest against a Cygnus-A-like source for fx = 1. 
However, increasing the integration time can allow a significant 
detection even for this case. (Compare with Figure l4bl) This 
suggests that even if fx is high and very radio-loud sources are 
not available for high signal-to-noise in spectral measurements 
of the 21cm forest, a detection of absorption (and therefore a 
constraint on fx) may be possible with extended observations. 

7.2 Gaussian fitting 

In spectra with numerous strong absorption lines, the signal 
variance is dominated by the lines, making the statistical method 
described in Section [77X1 above more difficult. In order to take 
the variance due to lines into account, one would have to have a 
detailed model of the way in which fx affects the number and 
character of absorption lines. Since such models are still highly 
uncertain, we developed a method to effectively remove the ab- 
sorption lines from the spectrum in order to better characterize 
the thermal properties of the mean IGM. 

Our method uses Gaussian fitting essentially as a filter to 
remove highly non-Gaussian elements (i.e., narrow absorption 
lines). First, we source-subtracted the spectra as discussed in 
Section lTTI For a spectrum with no absorption lines added, this 
results in a "normalized" spectrum in which the unabsorbed re- 
gion consists of Gaussian noise with a mean of zero and the 
absorbed region contains both Gaussian noise and a mildly non- 
Gaussian distribution of absorption and transmission regions 
with a mean below zero - this mean is the flux decrement. We 
reiterate that this depends strongly on how perfectly the source 
spectrum can be subtracted in practice. 

Let Fo(v) be the flux density of the source as a function of 
frequency. In a pixel at some frequency Vo, the observed flux is 

F(vo) = F (vo)T(uo) « FoM[l - r(u )] (8) 

where T(y) is the transmission fraction at frequency v and 
t(v) < 1 is the corresponding optical depth. If we source- 
subtract the spectrum, we have 

F N (v ) = -FoOoM^o) (9) 

where Fn indicates the source-subtracted (normalized) flux. 
For the unabsorbed region in which r = (i.e., redward of the 
redshifted 21cm frequency), Fn — on average, the only vari- 
ation being due to instrumental noise, which we assume here 
to be Gaussian. We denote the average flux of the unabsorbed 
region on the left-hand side of the spectrum Fl and the aver- 
age flux of the absorbed region on the right-hand side Fn. For 
a small frequency range around the 21cm transition, (Fl) = 
and 

(F R ) = -(F ,r)(tr). (10) 

This defines the average flux decrement AFn of the absorption 
region: 

AF N = (F R ) (11) 

Note that we have chosen a convention in which the flux decre- 
ment is a negative quantity - strictly the (negative) distance from 
Fn — rather than the (positive) distance below Fn = 0. 

If no lines are present in the spectrum, the mean normal- 
ized flux in the absorbed region gives the flux decrement, which 



can be related in a straightforward way to the optical depth and, 
consequently, fx- However, if absorption lines are present in 
the spectrum, they can distort the measured average flux decre- 
ment, making it difficult to straightforwardly relate it to the 
mean r in the IGM. Similarly, absorption lines dominate the 
signal variance. Gaussian fitting solves this problem. Instead 
of calculating the mean or standard deviation of the flux dis- 
tribution itself, we make a histogram of the flux distribution of 
each region (absorbed and unabsorbed) and fit this to a Gaus- 
sian. The mean and standard deviation of the Gaussian fit are 
close to those that would be found if the lines were not included 
in the spectrum, since the lines constitute non-Gaussian outliers 
that, due to their narrowness and the large region of the spectrum 
considered, have little weight in the fitting calculation. In Figure 
[9] we show example histograms and Gaussian fits for the unab- 
sorbed region of the spectrum as well as the absorbed region 
with and with 100 lines added. In each plot, we also include: 
/^Gaussian ^Gaussian' me mean an d standard deviation of 
the Gaussian fits, and /^ata and c-(j ata , the mean and standard 
deviation of the simulated spectrum "data." We also include the 
skewness of the simulated spectrum points. The non-Gaussian 
outliers have a strong influence on the skewness of the distri- 
bution; in this case, a large (negative) skewness indicates the 
presence of many absorption lines. In Figure [9c] the range of 
the plot extends far to negative flux values to include the bins in 
the histogram corresponding to the absorption lines. As is evi- 
dent in this plot, while the absorption lines extend to low fluxes, 
their distribution is such that they cannot be seen by eye in the 
histogram and they do not significantly affect the location of the 
Gaussian fit. 

In Figures[lO][TT]and[T2] we show the Gaussian fit-derived 
flux decrement, the difference between the absorbed and unab- 
sorbed fit-derived standard deviation, and the simulation points' 
skewness as a function of the number of lines added. In each 
plot, blue squares indicate fx = 1, black diamonds indicate 
fx = 0.1, and a red line is included at for reference. For 
Figures [Tol and [TT1 we estimated the errors of the Gaussian fit 
parameters using bootstrap resampling of the spectrum points. 

Figure [10] shows that, in the idealized case in which fore- 
grounds and the source spectra are well understood, this method 
can very accurately determine fx- The robustness of the flux 
decrement estimated by the Gaussian fit - regardless of the num- 
ber of lines included - is striking. We see in the plot that even 
for fx = 1, the flux decrement is easily distinguished from 
0, making a detection of 21cm absorption and an estimate of 
fx possible. Even if there are 500 lines added to the absorption 
spectrum, the flux decrement can be translated into a high pre- 
cision estimate of r via equation tlOt . For fx = 0.1, the flux 
decrement is similarly stable and is detectable at a much higher 
significance. The strong separation between the fx = 1 and 
fx = 0.1 points in Figure [Tol further suggests that fx might 
potentially be determinable to closer than an order of magni- 
tude if the foregrounds and spectra were well known. 

Figure[JTJplots or — ctl where or is the Gaussian-derived 
standard deviation in the absorbed region and ol is the value in 
the unabsorbed region. It is clear from this figure that while the 
outlying absorption feature points do not significantly move the 
Gaussian, they do somewhat broaden it (though not significantly 
enough that an fx = 1 spectrum with many lines would be mis- 
taken for an fx =0.1 spectrum with few lines). This measure- 
ment should be possible even in a realistic observation in which 
the foregrounds and source spectrum had to be approximated 
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with a running powerlaw and extrapolated from the unabsorbed 
region, as the relative widths of the distributions should still be 
robust. 

Figure[T"2lpresents the skewness of the simulation distribu- 
tion as a function of lines added. As can be seen in the plot, a 
large negative skewness occurs when large numbers of absorp- 
tion lines are present. The skewness is not monotonic, however. 
This is due to the fact that the presence of more absorption lines 
means there are fewer channels that contribute to the roughly 
Gaussian part of the flux distribution, and although the distribu- 
tion is more heavily skewed at the low-flux end, it also becomes 
broader and less Gaussian overall. Since the standard deviation 
contributes inversely in the definition of skewness, the magni- 
tude of the skewness decreases as the number of outliers grows 
very large. 



8 CONCLUSIONS AND OUTLOOK 

We have shown that the magnitude and statistical significance 
of the 21cm forest signal is highly sensitive to assumptions 
about the thermal history of the IGM, and that the detectability 
of the signal also depends strongly on the redshift and radio- 
loudness of the background source. The strongest signal can 
be seen when x-ray heating is relatively inefficient (as param- 
eterized by low values of the x-ray efficiency fx, described in 
Sections[3]and|4j and when the source is very distant and radio- 
loud. For example, if the IGM is heated in a process for which 
fx = 1, a background radio source with flux density S > 100 
mJy at redshift z > 10 would be needed for 21cm forest fea- 
tures to be significantly detected within a week of integration on 
the SKA. If x-ray heating is relatively inefficient, however, and 
fx ~ 0.01, the same signal-to-noise could be achieved with a 
source of 10 mJy at redshift z ~ 9. In any case, it is clear that 
radio-loud sources at high redshift must be found for the 21cm 
forest signal to be observed. 

This conclusion is based on a relation [equation l[5}] com- 
monly used to quantify the detectability of the 21cm forest. 
However, we point out that the relation applies specifically to 
attempts to study the features of the 21cm forest at high signal- 
to-noise. In this study, our main interest is in using the 21cm 
forest to study the thermal history of the high-redshift IGM. 
We find this can be accomplished through the use of statisti- 
cal methods that allow a high-significance detection of 21cm 
absorption against objects with lower flux densities or redshifts 
than those suggested by equation l(5j- This suggests that the de- 
tection of 21cm absorption may be less challenging than previ- 
ously thought. 

One of the statistical methods we discuss, which is appli- 
cable when strong absorption lines are not present, makes use 
of the increase in variance in the source spectrum at the fre- 
quency at which 21cm absorption begins. The passage of the 
radio signal alternately through regions of high ionization (HII 
regions produced by photoionization from bright UV sources) 
and through regions of relatively neutral gas (which can strongly 
absorb 21cm radiation) results in a pattern of high and low trans- 
mission and thus significantly increases the variance of the sig- 
nal above the noise in certain cases. The measurement of this 
effect may allow us to detect 21cm absorption and place limits 
on the thermal history of the IGM even in cases where the de- 
tection of the forest features is not significant. A combination of 
long integration times and variance studies can therefore allow 



us to extract information from the 21cm forest even if the IGM 
is relatively hot and if bright high-redshift sources are rare. 

In the case that strong absorption lines are numerous, the 
lines dominate the spectrum's variance in the absorbed region, 
making the use of the variance method less straightforward. Al- 
though an increase in varia nce can signal that absorption is oc- 
curring dCarilli et alj|2004l) . the uncertainties in modelling the 
number and character of absorption lines make it difficult to di- 
rectly link the level of variance increase with the thermal prop- 
erties of the high-redshift IGM. A measurement of the aver- 
age flux decrement in the absorbed region would similarly be 
strongly affected by the properties of the absorption lines. 

In Section 17.21 we present a second method that can be 
used to extract thermal information about the IGM even if a 
spectrum contains many strong absorption lines. The method 
effectively removes absorption lines based on their narrowness 
and non-Gaussian nature, by fitting the source-subtracted flux 
distribution across a wide band to a Gaussian. We show that the 
flux decrement determined with this method can be detected at 
high significance even for observational scenarios far less favor- 
able than those derived through a naive application of equation 
l[5}. The estimates here should be considered a best-case sce- 
nario, due to the difficulty of accurately source-subtracting the 
spectrum in the presence of foregrounds. Equation {5} suggests 
that an SKA-like observatory looking for absorption against a 
z = 9 source with a radio flux similar to Cygnus A would re- 
quire nearly a decade of integration time for a detection of the 
21cm forest. By contrast, our method shows that a significant 
detection of the flux decrement - and therefore a measurement 
of the optical depth and a constraint on fx - could, in principle, 
be achieved with an integration time of less than 1 week for the 
same object and observatory. Even without perfect source sub- 
traction, the width of the flux distribution over a wide band can 
be used to clearly distinguish amongst different values of fx- 
Note that this does not depend st rongly on properties of the ab- 
sorption lines [in contrast to, e.g. lXu. Ferrara & Chen|j2~01lh l. 

This has implications for near-term reionization-era ob- 
servational efforts. Our result shows that high-significance de- 
tections may be possible with instruments such as LOFAR or 
MWA, given a sufficiently radio-loud and distant source and a 
good understanding of foregrounds and source spectra. Looked 
at a different way, this method could make it possible for the 
21cm forest to be detected against lower-redshift or dimmer 
sources with the SKA. 

In this work, we aimed to present the 21cm forest signal 
produced by the mean evolving IGM and the structure of large- 
scale photoionization along the line of sight. While this study 
presents an outline of the feasibility and major challenges of fu- 
ture 21cm forest measurements of the IGM, another potential 
extension of this work would be to use a more detailed simu- 
lation of the IGM properties and intervening absorption lines, 
perhaps from a large-scale numerical simulation including real- 
istic dark matter and baryonic physics and radiative transfer, to 
more precisely simulate the expected signal in a high-redshift 
radio source. Planned future work also involves a more detailed 
treatment of the scope of this method in light of the capabil- 
ity of near-term observatories, and the possibility of fitting for 
the reionization model as part of the analysis. Another crucial 
element of any estimate of the utility of 21cm forest studies is 
an accurate estimate of the number of high-redshift radio-loud 
sources that will be available in the era of the SKA. We have 
given a brief overview in Section[5]of recent estimates, but up- 
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dated theoretical studies and more observational follow-up of 
known radio-loud sources would both strongly impact the case 
for pursuing 21cm forest observations as a diagnostic tool for 
studying the reionization-era IGM. 
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Figure 1. Uniform IGM simulation output plots. |(a)| Mean IGM ionized 
fraction Xi as a function of redshift. [(b)] Gas kinetic temperature (solid 
line) and CMB temperature (dotted line) as a function of redshift. |(c)| 
Mean 21cm optical depth as a function of redshift. The model shown 
here has fx = 1 (see Section|4j. 
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Figure 2. Non-uniform IGM simulation output plots. |(a)| Overdensity S 
and [(b)] neutral hydrogen fraction xjjj in the non-uniform simulation 
re gion as a function of re dshift, from a two-phase IGM code developed 
by iGeil & Wvifhd (2008[). The size of the simulation box is 70 cMpc 
cubed, with 360 pixels per sidelength. Multiple slices through the sim- 
ulation box are used, giving a total of 280 cMpc. 



f x =0 (no x-ray heating) 
f x =0.01 
1=0.1 

x f x =1 

f x =10 
f x =100 




1e-06 



1e-07 



140 145 150 155 160 

observed frequency (MHz) 

(b) 

Figure 3. 21cm absorption optical depth. [(a)] Mean 21cm optical depth 
r as a function of observed frequency, for varying fx values: from top 
to bottom curve, fx = (no x-ray heating), 0.01, 0.1, 1 (thick black 
line), 10, 100. Also included are curves of Smin (dashed purple lines) 
as defined in equation {5). These lines indicate, for each value of r on 
the left-hand axis, the minimum observed flux density of a source that 
would allow a detection of absorption at signal-to-noise of 5, assum- 
ing an array with effective area A e ff = 10 6 m 2 , frequency resolution 
Af c h = 1 kHz, system temperature given by equation {6), and integra- 
tion time t-i n t = 1 week. The redshift dependence comes from the fre- 
quency dependence of the system temperature. Where the mean optical 
depth lines (solid) cross above the Smin lines (dashed), the absorption 
is detectable at that redshift. [(b)| 2 1 cm forest optical depth as a function 
of redshift, for varying fx values: from top to bottom curve, fx = 
(no x-ray heating), 0.01, 0.1, 1 (thick black line), 10, 100. 
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Figure 4. Minimum flux density curves for S/N > 5 detection of the 
21cm forest for f x = (0, 0.01, 0.1, 1) (solid lines), with the redshifted 
21cm flux density of a Cygnus A-like source also plotted as a function 
of redshift (dashed line). The instrument is assumed to be SKA with 
channel width of 1 kHz and integration time of |(a)| 1 week and |(b)| 1 
year. In |(b)| the fx = line is below the limits of the plot. 
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Figure 5. Simulated spectrum for 21cm absorption against the spec- 
trum of a radio source at z = 9 with spectral properties similar to that 
of Cygnus A. The x-ray efficiency factors in these simulations are |(a)| 
fx = l, |(b)| /x = 0.1 and |(c)| /x = 0.01. Measurement noise ap- 
propriate to observation with SKA, with Ai/ c /j = 1 kHz and ti„t = 1 
week is included. In this simulation, overdense neutral regions (e.g., 
minihalos or protogalaxies) along the line of sight are not included. 
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Figure 6. Simulated spectrum for 21cm absorption against the spectrum 
of a radio source at z = 9 with spectral properties similar to that of 
Cygnus A, with |(a)| 100 and |(b)| 500 absorption lines added to simulate 
the presence of overdense neutral regions along the line of sight. The 
x-ray efficiency factor in these simulations is fx = 0.1. Observational 
parameters are as in Figure[3]above. 
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Figure 7. Mean binned signal variance as a function of frequency for 
windows with w = (50 kHz, 100 kHz, 500 kHz, 1 MHz, 5 MHz), for 
|(a)| fx = 0.1 and |(b)| fx = 1. No absorbers are included in this 
simulation. 
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Figure 8. Difference in mean variance between unabsorbed and ab- 
sorbed regions as a function of integration time for a window size of 
1 MHz, with fx = 1 and no lines added (red symbols with error- 
bars). Also included, for reference, is the difference in mean variance 
f° r fx = 0.1 with a window size of 1 MHz, for an integration time of 
1 week (blue cross with errorbars). 
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Figure 9. Gaussian fit (blue curve) to histogram of normalized flux den- 
sities from fx = 0.1 spectrum for the [(a)] unabsorbed and |(b)| absorbed 
region of the spectrum with no absorption lines added, as well as the |(c)| 
absorbed region with 100 lines added. The mean and standard deviation 
of both the spectrum points ("data") and Gaussian fit are included on 
the plots, as is the skewness of the spectrum points in each region. Note 
that in |(c)| the x-axis range has been extended to reflect the fact that 
the histogram extends to large negative values of the normalized flux, 
though the number of points in each low-flux bin is too small to be seen 
in the plot. 
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Figure 10. Difference between the Gaussian mean in the absorbed 
region and unabsorbed region of the spectrum (flux decrement) for 
fx = 1 (blue squares) and fx =0.1 (black diamonds), plotted against 
the number of absorption lines added to the spectrum. Error bars are de- 
rived from a bootstrap resampling of each spectrum. 
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Figure 11. Difference between the Gaussian standard deviation in the 
absorbed region and unabsorbed region of the spectrum for fx = 1 
(blue squares) and fx = 0.1 (black diamonds), plotted against the 
number of absorption lines added to the spectrum. Error bars are derived 
from a bootstrap resampling of each spectrum. 
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Figure 12. Skewness of the distribution of the normalized flux values in 
the absorbed region for fx = 1 (blue squares) and fx =0.1 (black 
diamonds), plotted against the number of absorption lines added to the 
spectrum. 
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